Artist Attribution via Song Lyrics

نویسنده

  • Michael Mara
چکیده

Song lyrics, separated from the audio signal of their song, still contain a significant amount of information. Mood and meaning can still be conveyed effectively by a pure textual representation. There has even been somewhat successful previous work on genre classification from song lyrics[7]. Building on previous work, we seek to build an artist attribution system for song lyrics. This task is in the same vein as classic author attribution tasks, which often are trained and evaluated on extremely large datasets[8]; providing more data per author than it is possible to get for most songwriters. In order to focus our task, we focus only on rap, as the songwriter and performer are usually the same, and there is a heavy emphasis on distinctive forms of lyricism. We actually enshrine that first assumption in our statement of the classification task: given a textual representation of the lyrics of a rap, return the name of the artist who raps it. This is a limitation we will have to live with for now, there does not exist any large public database that provides ghostwriting information for rappers. Potential use cases for such a classifier would be for detecting misattributed songs in a music library or as part of an auto-tagger in a music management system, along with other uses of author-attribution systems. A lyric-only classifier could also be used in an ensemble method that includes audio-only classifiers. Following previous work[2], we initially attempt to distinguish between 4 prolific rappers (Eminem, Nas, Jay Z, and Nicki Minaj) before expanding the classification task to encompass more artists, testing thoroughly on a 12-artist dataset, and eventually testing on over 300 rappers at once.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Lyric Jumper: A Lyrics-Based Music Exploratory Web Service by Modeling Lyrics Generative Process

Each artist has their own taste for topics of lyrics such as “love” and “friendship.” Considering such artist’s taste brings new applications in music information retrieval: choosing an artist based on topics of lyrics and finding unfamiliar artists who have similar taste to a favorite artist. Although previous studies applied latent Dirichlet allocation (LDA) to lyrics to analyze topics, LDA w...

متن کامل

Song Genre and Artist Classification via Supervised Learning from Lyrics

Motivation The amount of raw data available online has increased dramatically over the past few years; in order for us to maintain the usability of this data we must develop effective ways to efficiently and automatically organize it. For our CS 224N final project, we chose to develop a classifier that classifies songs into genres and/or artists based solely on their lyrics. We primarily focuse...

متن کامل

Quantifying Lexical Novelty in Song Lyrics

Novelty is an important psychological construct that affects both perceptual and behavioral processes. Here, we propose a lexical novelty score (LNS) for a song’s lyric, based on the statistical properties of a corpus of 275,905 lyrics (available at www.smcnus.org/lyrics/). A lyric-level LNS was derived as a function of the inverse document frequencies of its unique words. An artist-level LNS w...

متن کامل

Using Deep Learning to Annotate Karaoke Songs

Karaoke is a game in which players sing over pre-recorded instrumental backing tracks. To help the singer, lyrics are usually displayed on a video screen. The synchronization between the lyrics display and the song record, often done manually, is a tedious and time-consuming task. Automation of the annotation of karaoke songs can help save time and effort. In this thesis we use the representati...

متن کامل

Multi-modal Analysis of Music: A large-scale Evaluation

Multimedia data by definition comprises several different types of content modalities. Music specifically inherits e.g. audio at its core, text in the form of lyrics, images by means of album covers, or video in the form of music videos. Yet, in many Music Information Retrieval applications, only the audio content is utilised. Recent studies have shown the usefulness of incorporating other moda...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014